Shared-memory multiprocessor system and method for processing information

ABSTRACT

Large-scale table data stored in a shared memory are sorted by a plurality of processors in parallel. According to the present invention, the records subjected to processing are first divided for allocation to the plurality of processors. Then, each processor counts the numbers of local occurrences of the field value sequence numbers associated with the records to be processed. The numbers of local occurrences of the field value sequence numbers counted by each processor is then converted into global cumulative numbers, i.e., the cumulative numbers used in common by the plurality of processors. Finally, each processor utilizes the global cumulative numbers as pointers to rearrange the order of the allocated records.

TECHNICAL FIELD

The present invention relates to a method for processing information ina shared-memory multiprocessor system in which a plurality of processorsshare a memory to perform parallel processing, and particularly relatesto an information processing method for sorting large-scale table datain the shared memory by use of the plurality of processors.

The present invention also relates to a shared-memory multiprocessorsystem that performs such information processing method.

The present invention further relates to a program for performing suchinformation processing method.

The present invention further relates to a memory medium storing suchprogram.

BACKGROUND ART

In this day and age when computers are used in every corner of societyand networks such as the Internet are widely used, the storing andprocessing of large-scale data has become common occurrence.

There has been an effort to develop efficient algorithms for processinglarge-scale data. Processing that is frequently required whenlarge-scale data, especially large-scale table data, is processed issorting. As efficient sorting algorithms, radix sort and counting sort(also referred to as distribution counting sort) are known. Countingsort may be utilized for the sorting of each digit in radix sort.Although counting sort is an efficient algorithm, its application islimited by the requirement of the following conditions:

1) objects to be sorted are integers;

2) the upper limit and lower limit of the integers to be sorted areknown; and

3) a difference between the upper limit and lower limit of the integersto be sorted is not exceedingly large.

Against this background, the inventor of the present invention hasdeveloped a data management mechanism that is suitable for thehigh-speed searching, summarizing, and sorting of large-scale table data(see Patent Document 1). This data management mechanism uses aninformation block for representing the individual field values of afield included in table data. In this information block, field valuesbelonging to a field of the table data are represented by field valuesequence numbers assigned to the respective field values and an array ofthe actual field values arranged in an order of field value sequencenumbers. An array is provided in which the field value sequence numberscorresponding to the field values of respective records are arranged inthe order of record numbers. The field value of a given record isidentified by finding the value corresponding to the field valuesequence number of this given record in the array of the field values.Further, a record to be processed in table data is identified by use ofan array in which record numbers are arranged in sequence.

The information block is a table in which field values corresponding tofield value sequence numbers are stored in the order of field valuesequence numbers with respect to each field of the table data, whereinthe field value sequence numbers represent the sequencing of fieldvalues (i.e., assigning integers to field values) belonging to a givenfield. The field values may be any type of data such as numerical values(integer, fixed point, floating point, or the like) or characterstrings. This data management mechanism has an advantage in that a valueof any data type can be treated as an integer that is a field valuesequence number. Namely, when character-string data are to be sortedaccording to this data management mechanism, for example,character-string data are not the actual sort objects subjected tosorting, but field value sequence numbers corresponding to the values ofcharacter-string data are the actual sort objects that are to be sorted.In so doing, the results of sorting are represented by an array in whichrecord numbers are arranged in sequence. In this manner, theinformation-block-based data management mechanism developed by theinventor of the present invention is advantageous in that theabove-noted conditions 1) through 3) required for the application ofcounting sort are satisfied.

There has also been an effort to introduce parallel processing in orderto perform a vast amount of computation at high speed that is necessaryfor the purpose of processing large-scale data. For the sorting purpose,also, various parallel sorting algorithms have been developed. Ingeneral, parallel processing architectures are classified manly intodistributed-memory type and shared-memory type. In the case of thedistributed-memory type, a system is configured by combining a pluralityof processors each having a local memory. In this arrangement, it ispossible in theory to design a hardware system incorporating hundreds totens of thousands of processes. However, the distributed-memory type isfraught with technical problems such as the complexity of distributeddata management and the low efficiency of processor-to-processorcommunication. In the shared-memory type, on the other hand, a pluralityof processors share a single large memory space. In this arrangement,traffic between a set of processors and the shared memory becomesbottleneck, so that it is believed to be difficult to implement a systemincorporating more than 100 processors in practice.

Against this background, it is now possible to obtain a personalcomputer that is implemented as a shared-memory multiprocessor systemusing a plurality of CPUs. Typical CPUs used in such a personal computeroperate with internal clock that is five to six times faster than theclock of the memory bus, and are provided with an automatic parallelprocessing mechanism and pipeline processing mechanism embedded therein,thereby processing one item of data in one clock cycle (i.e., memory busclock cycle).

[Patent Document 1] International Publication WO00/10103

DISCLOSURE OF INVENTION Problem to be Solved by the Invention

It is thus desired to combine an efficient sorting algorithm and ashared-memory multiprocessor system in order to process large-scaletable data.

Counting sort that is known to be an efficient sorting algorithm islimited as to its application by the requirement of the above-notedconditions 1) through 3). Because of this, it is difficult to applycounting sort to the processing of large-scale table data unless theinformation-block-based data management mechanism developed by theinventor of the present invention is utilized. It should be noted thatthe technology for performing parallel sorting on large-scale table datain a shared-memory multiprocessor system are not known to date.

Accordingly, it is an object of the present invention to utilize theinformation-block-based data management mechanism so as to provide aninformation processing method for sorting large-scale table data storedin the shared memory in a parallel manner by use of a plurality ofprocessors.

Further, it is another object of the present invention to provide ashared-memory multiprocessor system that performs such informationprocessing method.

Moreover, it is a further object of the present invention to provide aprogram for performing such information processing method.

Also, it is another object of the present invention to provide a memorymedium having such a program recorded therein.

Means to Solve the Problem

The present invention is relied on a data management mechanism havingits basis on the information block that is a table in which field valuescorresponding to field value sequence numbers are stored in the order(either ascending order or descending order) of field value sequencenumbers with respect to each field of the table data, wherein the fieldvalue sequence numbers represent the sequencing of field values (i.e.,assigning integers to field values) belonging to a given field. Thefield values may be any type of data such as numerical values (integer,fixed point, floating point, or the like) or character strings. With theuse of this data management mechanism, a value of any data type can betreated as an integer that is a field value sequence number. Namely,when data of any given type are to be sorted according to this datamanagement mechanism, the data of this type are not the actual sortobjects subjected to sorting, but field value sequence numberscorresponding to the values of this data are the actual sort objectsthat are to be sorted. Accordingly, the data management mechanism basedon this information block satisfies the requirements for application ofthe counting sort. Further, since a record to be processed in table datais identified by use of an array in which record numbers are arranged insequence, the results of sorting will be represented by an array inwhich record numbers are arranged in sequence.

The present invention applies such data management mechanism to ashared-memory multiprocessor system, thereby achieving an informationprocessing method for sorting large-scale table data in the sharedmemory by a plurality of processors in parallel, and also achieving ashared-memory multiprocessor system performing such informationprocessing method. For this purpose, according to the present invention,the records subjected to processing are first divided for allocation tothe plurality of processors. Then, each processor counts the numbers oflocal occurrences of the field value sequence numbers associated withthe records to be processed. The numbers of local occurrences of thefield value sequence numbers counted by each processor is then convertedinto global cumulative numbers, i.e., the cumulative numbers used incommon by the plurality of processors. Finally, each processor utilizesthe global cumulative numbers as pointers to rearrange the order of theallocated records. According to the present invention, thus, records canbe sorted in parallel with respect to the filed values of a given recordfield (e.g., integers, fixed-point numerical values, floating-pointnumerical values, and character strings) in a shared-memorymultiprocessor system.

A plurality of processors may perform in a parallel fashion theallocation of records to be processed to the plurality of processors,the counting of the numbers of local occurrences, and the rearrangementof the order of the allocated records. The computation of globalcumulative numbers may be performed by utilizing parallel processing bythe plurality of processors, but can be performed by one processor orpart of the processors while maintaining sufficient speed because acache hit rate is high due to sequential accessing of the memory.

The principle of the present invention as described above is implementedby various embodiments as follows.

The first embodiment of the present invention is directed to aninformation processing method for rearranging the order of recordsaccording to the field values of the records in a predetermined field ina shared-memory multiprocessor system. A shared-memory multiprocessorsystem includes a shared memory to store a record number array in whichrecord numbers of table data records are stored according to apredetermined record order, a field value sequence number array in whichfield value sequence numbers corresponding to field values of the tabledata records in the predetermined field are stored according to therecord numbers, and a field value array in which the field values of thetable data are stored according to an order of the field value sequencenumbers corresponding to the field values, and further includes aplurality of processors operable to access the shared memory. Theinformation processing method of the present invention includes:

a step of dividing the record number array into portions which areallocated to a first plurality of processors;

a step of counting, by each of the first plurality of processors,numbers of occurrences of the field value sequence numbers correspondingto the records contained in an allocated portion of the record numberarray;

a step of dividing a range of the field value sequence numbers intoportions which are allocated to a second plurality of processors;

a step of converting, by each of the second plurality of processors, thenumbers of occurrences of the field value sequence numbers in anallocated portion into cumulative numbers in an order of the field valuesequence numbers and in an order of the portions of the record numberarray within a range corresponding to the same field value sequencenumber; and

a step of utilizing, by each of the first plurality of processors, aspointers the cumulative numbers of the field value sequence numberscorresponding to the records contained in the allocated portion of therecord number array, thereby storing the record numbers contained in theallocated portion of the record number array in a new record numberarray.

This information processing method achieves parallel processing withrespect to the counting of the numbers of occurrences of field valuesequence numbers, parallel processing with respect to the conversion ofthe numbers of occurrences into cumulative numbers, and parallelprocessing with respect to the generation of a new record number array.The present invention thus expands the technology of counting sort suchthat it is applicable to the shared-memory multiprocessor systemenvironment, thereby achieving the parallel sorting of large-scale tabledata in a shared-memory multiprocessor system. Among the plurality ofprocessors constituting the multiprocessor system, a first plurality ofprocessors selected arbitrarily take care of respective portions of therecode number array, and a second plurality of processes selectedarbitrarily take care of the respective ranges of the field valuesequence numbers. It should be noted that the number of the firstplurality and the number of the second plurality may be equal to thetotal number of processors constituting the multiprocessor system, ormay be only part of these.

The information processing method of the present invention may utilizethe concept of radix sort with respect to the field value sequencenumbers, thereby achieving the multi-stage parallel sorting oflarge-scale table data in a shared-memory multiprocessor system. If thesize of the field value sequence number array is large, for example, thefield value sequence number array may be compressed to improve theefficiency of processing. To this end, the information processing methodof the present invention includes:

a step of selecting radix representation of the field value sequencenumbers in response to a range of the field value sequence numbers; and

a step of repeating sorting with respect to a digit of interest that isselected successively from a least significant digit to a mostsignificant digit in the radix representation of the field valuesequence numbers, by use of the record number array as a current recordnumber array for a first time sorting and by use of a new record numberarray as a current record number array for a second time sorting andonward.

With this arrangement, parallel sorting is performed separately for eachdigit of the field value sequence numbers from the least significantdigit to the most significant digit. The above-noted sorting includes:

a step of dividing the current record number array into portions whichare allocated to a first plurality of processors;

a step of counting, by each of the first plurality of processors,numbers of occurrences of values of the digit of interest in the fieldvalue sequence numbers corresponding to the records contained in anallocated portion of the record number array;

a step of dividing a range of the values of the digit of interest of thefield value sequence numbers into portions which are allocated to asecond plurality of processors;

a step of converting, by each of the second plurality of processors, thenumbers of occurrences of the values of the digit of interest in thefield value sequence numbers in an allocated portion into cumulativenumbers in an order of the values of the digit of interest in the fieldvalue sequence numbers and in an order of the portions of the recordnumber array within a range corresponding to the same value of the digitof interest in the field value sequence number; and

a step of utilizing, by each of the first plurality of processors, aspointers the cumulative numbers of the values of the digit of interestin the field value sequence numbers corresponding to the recordscontained in the allocated portion of the record number array, therebystoring the record numbers contained in the allocated portion of therecord number array in the new record number array.

According to the present invention, sorting is repeated with respect tothe digit of interest selected successively from the least significantdigit to the most significant digit of the field value sequence numbers,so that sorting regarding the field value sequence numbers is achievedin conformity with the concept of radix sort. The parallel sorting oflarge-scale table data is thus achieved in a shared-memorymultiprocessor system.

In the multi-stage parallel sorting described above, the step ofconverting the numbers of occurrences of the values of the digit ofinterest in the field value sequence numbers into cumulative numbers isperformed in parallel by a second plurality of processors. Depending onthe circumstances, however, this step may be performed at high speedwithout parallel processing by a plurality of processors. This isbecause the process of this step is performed sequentially, so that thecache hit rate will be high. In consideration of this, the informationprocessing method of the present invention includes:

a step of selecting radix representation of the field value sequencenumbers in response to a range of the field value sequence numbers; and

a step of repeating sorting with respect to a digit of interest that isselected successively from a least significant digit to a mostsignificant digit in the radix representation of the field valuesequence numbers, by use of the record number array as a current recordnumber array for a first time sorting and by use of a new record numberarray as a current record number array for a second time sorting andonward,

wherein the sorting includes:

a step of dividing the current record number array into portions whichare allocated to a plurality of processors;

a step of counting, by each processor, numbers of occurrences of valuesof the digit of interest in the field value sequence numberscorresponding to the records contained in an allocated portion of therecord number array;

a step of converting, by at least one processor, the numbers ofoccurrences of the values of the digit of interest in the field valuesequence numbers in an allocated portion into cumulative numbers in anorder of the values of the digit of interest in the field value sequencenumbers and in an order of the portions of the record number arraywithin a range corresponding to the same value of the digit of interestin the field value sequence number; and

a step of utilizing, by said each processor, as pointers the cumulativenumbers of the values of the digit of interest in the field valuesequence numbers corresponding to the records contained in the allocatedportion of the record number array, thereby storing the record numberscontained in the allocated portion of the record number array in the newrecord number array.

In this information processing method, the range of the digit ofinterest of field value sequence numbers is not divided for allocationto a plurality of processors, and at least one processor, or preferablyone processor alone, converts the numbers of occurrences of the valuesof digit of interest in field value sequence numbers into cumulativenumbers. In this case also, sorting is repeated with respect to thedigit of interest selected successively from the least significant digitto the most significant digit of the field value sequence numbers, sothat sorting regarding the field value sequence numbers is achieved inconformity with the concept of radix sort. The parallel sorting oflarge-scale table data is thus achieved in a shared-memorymultiprocessor system.

To achieve the objects as described above, the present inventionprovides an information processing method of rearranging an order ofrecords according to field values of the records in a predeterminedfield in a shared-memory multiprocessor system, which includes a sharedmemory to store a record number array in which record numbers of tabledata records are stored according to a predetermined record order, afield value sequence number array in which field value sequence numberscorresponding to field values of the table data records in thepredetermined field are stored according to the record numbers, and afield value array in which the field values of the table data are storedaccording to an order of the field value sequence numbers correspondingto the field values, and further includes a plurality of processorsoperable to access the shared memory. The information processing methodincludes:

a step of dividing the record number array into portions which areallocated to a plurality of processors;

a step of rearranging, by each of the plurality of processors, an orderof the records contained in the allocated portion of the record numberarray according to the field value sequence numbers corresponding to therecords, thereby storing the record numbers of the records in a newrecord number array.

To achieve the objects as described above, further, the presentinvention provides an information processing method of rearranging anorder of records according to field values of the records in apredetermined field in a shared-memory multiprocessor system, whichincludes a shared memory to store a record number array in which recordnumbers of table data records are stored according to a predeterminedrecord order, a field value sequence number array in which field valuesequence numbers corresponding to field values of the table data recordsin the predetermined field are stored according to the record numbers,and a field value array in which the field values of the table data arestored according to an order of the field value sequence numberscorresponding to the field values, and further includes a plurality ofprocessors operable to access the shared memory. The informationprocessing method includes:

a step of selecting radix representation of the field value sequencenumbers in response to a range of the field value sequence numbers;

a step of rearranging record numbers in the record number array withrespect to an upper-order digit of the radix representation of the fieldvalue sequence numbers so as to generate an intermediate record numberarray that has sections thereof arranged in an order of values of theupper-order digit;

a step of allocating the sections of the intermediate record numberarray to respective processors; and

a step of rearranging, by each of the processors allocated to therespective sections, the record numbers in the corresponding section ofthe intermediate record number array in an order of values of alower-order digit of the field value sequence numbers.

The second embodiment of the present invention is directed to ashared-memory multiprocessor system that includes a shared memory and aplurality of processors operable to access the shared memory, and thatperforms the information processing method of the present inventiondescribed above. In the shared-memory multiprocessor system of thepresent invention, the shared memory stores a record number array inwhich record numbers of table data records are stored according to apredetermined record order, a field value sequence number array in whichfield value sequence numbers corresponding to field values of the tabledata records in the predetermined field are stored according to therecord numbers, and a field value array in which the field values of thetable data are stored according to an order of the field value sequencenumbers corresponding to the field values. With this arrangement, theshared-memory multiprocessor system of the present invention can utilizethe data management mechanism based on the block information.

Each of the processors includes:

a part to determine a portion of the record number array that is to betaken care of by a corresponding processor;

a part to count numbers of occurrences of the field value sequencenumbers corresponding to the records contained in the portion of therecord number array;

a part to determine a range of the field value sequence numbers that isto be taken care of by a corresponding processor;

a part to convert the numbers of occurrences of the field value sequencenumbers in the range that is taken care of into cumulative numbers in anorder of the field value sequence numbers and in an order of portions ofthe record number array within a range corresponding to the same fieldvalue sequence number; and

a part to utilize as pointers the cumulative numbers of the field valuesequence numbers corresponding to the records contained in the portionof the record number array, thereby storing the record numbers containedin the portion of the record number array in a new record number array.

Since each processor can operate in parallel, parallel processing isachieved with respect to the counting of the numbers of occurrences,with respect to the conversion of the numbers of occurrences intocumulative numbers, and with respect to the generation of a new recordnumber array.

There is a need to carry over the obtained cumulative numbers in theorder of field value sequence numbers when the numbers of occurrences offield value sequence numbers are to be converted into cumulativenumbers. Because of this, the cumulative numbers obtained by the part toconvert the numbers of occurrences into the cumulative numbers in aprocessor taking care of an immediately preceding range of the fieldvalue sequence numbers are referred to by the part to convert thenumbers of occurrences into the cumulative numbers in a processor takingcare of an immediately following range.

In the shared-memory multiprocessor system of the present invention, forthe purpose of utilizing the concept of radix sort with respect to thefield value sequence numbers to achieve the multi-stage parallel sortingof large-scale table data, each processor includes:

a part to select radix representation of the field value sequencenumbers in response to a range of the field value sequence numbers; and

a part to repeat sorting by selecting a digit of interest successivelyfrom a least significant digit to a most significant digit in the radixrepresentation of the field value sequence numbers, by use of the recordnumber array as a current record number array for a first time sortingand by use of a new record number array as a current record number arrayfor a second time sorting and onward.

With this arrangement, parallel sorting is performed sequentially on adigit-by-digit basis from the least significant digit to the mostsignificant digit of the field value sequence numbers. Further, the partto repeat sorting includes:

a part to determine a portion of the record number array that is to betaken care of by a corresponding processor;

a part to count numbers of occurrences of values of the digit ofinterest in the field value sequence numbers corresponding to therecords contained in the portion of the record number array;

a part to determine a range of the values of the digit of interest inthe field value sequence numbers that is to be taken care of by acorresponding processor;

a part to convert the numbers of occurrences of the values of the digitof interest in the field value sequence numbers in the range that istaken care of into cumulative numbers in an order of the values of thedigit of interest in the field value sequence numbers and in an order ofportions of the record number array within a range corresponding to thesame value of the digit of interest in the field value sequence number;and

a part to utilize as pointers the cumulative numbers of the values ofthe digit of interest in the field value sequence numbers correspondingto the records contained in the portion of the record number array,thereby storing the record numbers contained in the portion of therecord number array in a new record number array. With this arrangement,parallel sorting is performed on a digit-by-digit basis with respect tothe field value sequence numbers. According to the present invention, aplurality of processors perform parallel processing with respect to thecounting of the numbers of occurrences, the conversion of the numbers ofoccurrences into cumulative numbers, and the generation of a new recordnumber array in the sorting performed separately for each digit of thefield value sequence numbers.

In order for the plurality of processors to share the task of convertingthe numbers of occurrences into cumulative numbers, in the presentinvention, the cumulative numbers obtained by the part to convert thenumbers of occurrences into the cumulative numbers in a processor takingcare of an immediately preceding range of the digit of interest in thefield value sequence numbers are referred to by the part to convert thenumbers of occurrences into the cumulative numbers in a processor takingcare of an immediately following range.

The shared-memory multiprocessor system of the present invention thatperforms the multi-stage parallel sorting of large-scale table data mayuse at least one processor, or preferably one processor, alone, toperform the conversion of the numbers of occurrences of the values ofthe digit of interest into cumulative numbers. To this end, eachprocessor in the shared-memory multiprocessor system of the presentinvention includes a part to select radix representation of the fieldvalue sequence numbers in response to the range of the field valuesequence numbers, and a part to repeat sorting by selecting a digit ofinterest successively from a least significant digit to a mostsignificant digit in the radix representation of the field valuesequence numbers, by use of the record number array as a current recordnumber array for a first time sorting and by use of a new record numberarray as a current record number array for a second time sorting andonward.

The part to repeat the sorting in each processor includes a part todetermine a portion of the record number array that is taken care of bythe corresponding processor, and a part to count the numbers ofoccurrences of values of the digit of interest in the field valuesequence numbers corresponding to the records contained in the portionof the record number array.

Further, the part to repeat sorting in at least one processor includes apart to convert the numbers of occurrences of the values of the digit ofinterest in the field value sequence numbers into cumulative numbers inan order of the values of the digit of interest in the field valuesequence numbers and in an order of portions of the record number arraywithin a range corresponding to the same value of the digit of interestin the field value sequence number.

Moreover, the above-noted part to repeat sorting further includes a partto utilize as pointers the cumulative numbers of the values of the digitof interest in the field value sequence numbers corresponding to therecords contained in the portion of the record number array, therebystoring the record numbers contained in the portion of the record numberarray in the new record number array.

According to the present invention, there is no need for each processorto determine a range of the values of the digit of interest in the fieldvalue sequence numbers that is to be taken care of by the correspondingprocessor, and, also, there is no need for a plurality of processors toshare the task of converting the numbers of occurrences into cumulativenumbers. The configuration of the shared-memory multiprocessor system isthus simplified.

The third embodiment of the present invention provides a program thatimplements the information processing method as described above.

The fourth embodiment of the present invention provides acomputer-readable record medium that has such program recorded therein.

Advantage of the Invention

The present invention can provide an information processing apparatusthat can perform the high-speed parallel sorting of large-scale tabledata in the shared-memory multiprocessor system environment.

BEST MODE FOR CARRYING OUT THE INVENTION

In the following, various embodiments of the present invention will bedescribed with reference to the accompanying drawings.

[Configuration of Computer System]

FIG. 1 is a schematic diagram showing an embodiment of a computer systemthat performs an information processing method for sorting recordsaccording to the field values of the records in a predetermined fieldaccording to the present invention. As shown in FIG. 1, a computersystem 10 includes p processors (CPU) 12-1, 12-2, . . . , and 12-p forexecuting programs to control the entirety and individual parts of thesystem, a shared memory 14 such as a RAM (Random Access Memory) forstoring work data and the like, a ROM (Read Only Memory) 15 for storingprograms and the like, a fixed storage medium 18 such as a hard disk, aCD-ROM driver 20 for accessing a CD-ROM 19, an interface (I/F) 22disposed between the CD-ROM driver 20 or external network (not shown)and an external terminal for connection, an input apparatus 24 comprisedof a keyboard and mouse, and a CRT display apparatus 26. The CPU 12, theRAM 14, the ROM 16, the external storage medium 18, the I/F 22, theinput apparatus 24, and the display apparatus 26 are connected to eachother via the bus 28. Although not illustrated, each CPU may be providedwith a dedicated local memory.

A program for sorting records according to the field values of therecords in a predetermined field according to the present embodiment maybe stored in the CD-ROM 19 and read by the CD-ROM driver 20, or may bestored in the ROM 16 in advance. Alternatively, the program may be readfrom the CD-ROM 19 for storage in a predetermined area of the externalstorage medium 18. Alternatively, the program may be supplied from anexternal source via the network (not shown), the external terminal, andthe I/F 22.

A shared-memory multiprocessor system according to the presentembodiment may be implemented by the computer system 10 executing aprogram for sorting records according to the field values of the recordsin a predetermined field.

[Information-Block-Based Data Management Mechanism]

FIG. 2 is a drawing showing an example of table data for explaining adata management mechanism. This table data is stored in the computer asa data structure as shown in FIG. 3 by using the data managementmechanism described in International Publication No. WO00/10103described above.

As shown in FIG. 3, an array 301 (hereinafter simply referred to as“OrdSet”) that assigns sequence numbers representing the order ofinternal data to sequence numbers representing the order of individualrecords of table data is provided in which the sequence numbersrepresenting the order of internal data are arranged as values withrespect to the respective table-data records. In this example, all ofthe table data are represented as the internal data, so that the recordnumbers of table data correspond to the sequence numbers representingthe order of internal data.

As for gender, for example, it can be learned from the array OrdSet 301that the sequence number representing the order of internal datacorresponding to record 0 of the table data is “0”. An actual gendervalue of the record having sequence number “0”, which is either “male”or “female”, is obtained by referring to a pointer array 302(hereinafter simply referred to “VNo”) that points to a value list 303(hereinafter simply referred to “VL”) in which actual values are sortedin a predetermined sequence. The pointer array 302 stores pointerspointing to the items of the actual value list 303 according to theorder of sequence numbers stored in the array OrdSet 301. With thisarrangement, the gender field value corresponding to record “0” of thetable data is obtained by: (1) extracting sequence number “0”corresponding to record “0” from the array OrdSet 301; (2) extractingitem “1” corresponding to sequence number “0” from the pointer array 302pointing to the value list; and (3) extracting item “female” from thevalue list 303 that is indicated by item “1” extracted from the pointerarray 302 pointing to the value list.

Any field value can be similarly obtained with respect to other recordsand with respect to age and height.

In this manner, table data is represented by a combination of the valuelist VL and the pointer array VNo pointing to the value list, and thiscombination is referred to as “information block”. In FIG. 3, theinformation blocks regarding gender, age, and height are shown asinformation blocks 308, 309, and 310, respectively.

If a single memory (defined as meaning a single address space for accessdespite one or more physical entities) is used, it suffices for a singlecomputer to store in the memory an array OrdSet for ordered set and avalue list VL and pointer array VNo constituting each information block.In order to maintain a large number of records, however, it is desirableto perform parallel processing on the large number of records since therequired memory size increases in response to the record size.

In the present embodiment, thus, the plurality of processors accessrecord data stored in the shared memory, thereby achieving high-speedsorting based on the parallel processing by the plurality of processors.

[Parallel Sorting]

In the following, a description will be given of a parallel sortingmethod, which is an information processing method for sorting recordsaccording to the field values of the records in a predetermined field ina shared-memory multiprocessor system according to the presentembodiment. FIGS. 4A and 4B are drawings showing a data structure ofsort object. Table data 401 shown in FIG. 4A represents the datastructure of a sort object by use of a matrix format, in which 20records from record 0 to record 19 are included. Each record iscomprised of two fields, i.e., “age” and “area”. A data structure 402shown in FIG. 4B represents a data structure stored in the shared memory14 of the computer system 10. A record number array (OrdSet:representing an ordered set) 403 shown in FIG. 4B stores record numbers0 to 19 in a predetermined order. In this example, the record numbersare stored in the following order: 0 to 19. Age data and area data arestored as an information block 404 and an information block 405,respectively. The age information block 404 includes a field valuesequence number array 406 (which may hereinafter be referred to as VNo:value number) in which field value sequence numbers corresponding to agefield values are stored in the order of record numbers, and furtherincludes a field value array 407 (which may hereinafter be referred toas VL: value list) in which the age field values are stored in the orderof field value sequence numbers corresponding to these field values. Bythe same token, the area information block 405 includes a field valuesequence number array 408 in which field value sequence numberscorresponding to area field values are stored in the order of recordnumbers, and further includes a field value array 409 in which the areafield values are stored in the order of field value sequence numberscorresponding to these field values. The p processors 12-1, . . . , and12-p of the computer system 10 can access these data stored in theshared memory 14.

FIG. 5 is a flowchart showing a parallel sorting method according to anembodiment of the present invention. In this embodiment, the number ofCPUs is four, all of which operate in parallel. It should be noted thatthe number of CPUs in the system and the number of CPUs operating inparallel are not limited to this example. In the following, for the sakeof convenience of explanation, a description will be given of a case inwhich the age field items are sorted in an ascending order of age. Theitems of the age field value array are arranged in an ascending order ofage. The parallel sorting method is comprised of 5 steps, i.e., step 501to step 505.

Step 501: The record number array is divided four-fold, and the dividedportions are allocated to the four CPUs (see FIG. 6).

Step 502: The CPUs operate in parallel to count the numbers ofoccurrences of field value sequence numbers corresponding to the recordscontained in the allocated portion of the record number array (see FIGS.7A and 7B through FIGS. 9A and 9B).

Step 5-3: The range of the field value sequence numbers, i.e., fivevalues from field value sequence number 0 to field value sequence number4, are allocated to the four CPUs. For example, field value sequencenumbers 0 and 1 are allocated to CPU-0, and field value sequence numbers2 through 4 are allocated to CPU-1 through CPU-3, respectively (see FIG.10A).

Step 504: Each of the four CPUs converts the numbers of occurrences ofallocated field value sequence numbers to cumulative numbers in theorder of field value sequence numbers and in the order of the recordnumber array portions within a range corresponding to the same fieldvalue sequence number (see FIGS. 10A and 10B).

Step 505: The four CPUs utilize as pointers the cumulative numbers offield value sequence numbers corresponding to the records contained inthe allocated portions of the record number array, thereby storingrecord numbers contained in the allocated record-number-array portionsin another record number array (see FIGS. 11A and 11B through FIGS. 13Aand 13B).

In the following, each step will be described in detail.

FIG. 6 is a drawing for explaining the initialization step 501 of theparallel sorting method. The four CPUs CPU-0 through CPU-3 are allocatedwith respective four records taken in sequence from the top of therecord number array. For example, CPU-0 takes care of first itemOrdSet[0] to fifth item OrdSet[4] of the record number array (“x” asappear in OrdSet[x] means the subscript of array OrdSet). Count arraysCount-0, Count-1, Count-2, and Count-3 are provided in the shared memory14 for the purpose of counting the numbers of occurrences of field valuesequence numbers, and are assigned to the respective CPUs. The number ofthe Count arrays is identical to the number of CPUs, and the array sizeof each Count array is identical to the size of the VL array. Theelements of the Count arrays are initialized to zero.

FIGS. 7A and 7B through FIGS. 9A and 9B are drawings for explaining thecount-up step 502 of the parallel sorting method. In sub-step 1 of FIG.7A, CPU-0 reads value “0” of OrdSet[0], and uses this read value “0” asa subscript to read value “1” of VNo[0], followed by using this value“1” as a subscript to increment value “0” of Count-0[1] to “1”, forexample. By the same token, CPU-1 reads value “5” of OrdSet[5], and usesthis read value “5” as a subscript to read value “2” of VNo[5], followedby using this value “2” as a subscript to increment value “0” ofCount-1[2] to “1”. The same also applies in the case of CPU-2 and CPU-3.In sub-step 2 of FIG. 7B, CPU-0 reads value “1” of OrdSet[1], and usesthis read value “1” as a subscript to read value “3” of VNo[1], followedby using this value “3” as a subscript to increment value “0” ofCount-0[3] to “1”, for example. The same also applies in the case ofCPU-1, CPU-2, and CPU-3. As shown in FIGS. 8A and 8B and FIG. 9A, eachprocessor reads an element from the array OrdSet allocated thereto, anduses this element as a subscript to read an element from the array VNo,followed by using this read element as a subscript to increment thecorresponding element of the Count array. In the end, the count-upresults as shown in FIG. 9B are obtained. Element Count-0[i] of thearray Count-0 shown in FIGS. 9A and 9B represents the number ofoccurrences of age field value sequence number i corresponding to therecords contained in the range from OrdSet[0] to OrdSet[4] of the arrayOrdSet allocated to CPU-0. For example, Count-0[0] indicates that thenumber of occurrences of field value sequence number 0 within the rangeallocated to CPU-0 is 1, and Count-3[1] indicates that the number ofoccurrences of field value sequence number 1 within the range allocatedto CPU-3 is 2.

FIGS. 10A and 10B are drawings for explaining the accumulation steps 503and 504 of the parallel sorting method. In this example, cumulativenumbers are obtained in an ascending order of field value sequencenumbers in conformity with the ascending order sorting. CPU-0 takes careof the task of obtaining cumulative numbers for the first and secondrows (i.e., field value sequence numbers 0 and 1) of the Count array,and CPU-1 through CPU-3 take care of the task of obtaining cumulativenumbers for the third through fifth rows (i.e., field value sequencenumbers 3 through 5) of the Count array, respectively. As shown in FIG.10A, accumulation is first performed in the horizontal direction (i.e.,with respect to a row having the same subscript), and, then, thecumulative number of the preceding row is added to the cumulativenumbers of the following row, resulting in the obtainment of all thecumulative numbers. It should be noted that accumulation in thehorizontal direction can be performed by each CPU in parallel.

Count[i][j] is used to represent a count value of field value sequencenumber j (0≦j≦q−1) counted up by CPU-i that is the i-th CPU (0≦i≦p−1),and a cumulative number is indicated by Count′[i][i]. Then, theaccumulation process can be described as follows.Count′[0][0]=0Count′[i][0]=Count′[i−1][q−1]+Count[i−1][q]i>1Count′[i][j]=Count′[i][j−1]+Count[i][j−1]j>1

In the cumulative number computation described above, there is a need tocarry over offset Count′[i−1][q−1] from a preceding row to a followingrow. Although the CPUs perform the respective portions of the cumulativenumber computation in this embodiment, one of the processors may beselected to make this processor solely perform the cumulative numbercomputation.

FIG. 10B shows the sequence of the accumulation process in the verticaldirection. For example, in FIG. 10B, the row corresponding to “(1)Count-0: 0” indicates that count value “1” of first element Count-0[0]of the array Count-0 is converted into cumulative number 0. Namely, whena series of count values:

1, 2, 2, 0, 2, 0, 2, 2, 0, 2, 0, 1, 1, 1, 0, 1, 1, 0, 1, 1

is converted into cumulative numbers, the result will be obtained asfollows.

0, 1, 3, 5, 5, 7, 7, 9, 11, 11, 13, 13, 14, 15, 16, 16, 17, 18, 18, 19

FIGS. 11A and 11B through FIGS. 13A and 13B are drawings for explainingthe transfer step 505 that stores record numbers in a new record numberarray. In the transfer step, each CPU reads a record number belonging tothe allocated range from the record number array OrdSet, and uses thisrecord number as a subscript to read a field value sequence number fromthe pointer array VNo, followed by using this field value sequencenumber as a subscript to read a cumulative number from the Count arraythat is associated with the CPU and contains cumulative numbers. Theobtained cumulative number is used as a pointer to store the recordnumber in a new record number array OrdSet′, followed by incrementingthe cumulative number of the Count array by 1.

In sub-step 1 of FIG. 11A, for example, CPU-0 reads value “0” (i.e.,record number 0) of OrdSet[0], and then reads value “1” of VNo[0],followed by reading value “5” of Count-0[1] of the assigned Count array,and then setting record number 0 in OrdSet[5], with an increment of thevalue of Count-0[1] to 6. This transfer process for record numbers aresimilarly performed as shown in sub-step 2 of FIG. 11B, sub-steps 3 and4 of FIGS. 12A and 12B, and sub-step 5 of FIG. 13A. In the end, the newrecord number array OrdSet′ shown in FIG. 13B is obtained.

FIGS. 14A through 14C and FIGS. 15A and 15B are drawings showingoutcomes obtained by the parallel sorting method according to theembodiment of the present invention applied to the data structure shownin FIG. 4B. In this example, ascending-order sorting is performed withrespect to the age field, so that the obtained record number arrayOrdSet′ includes records that are arranged in an ascending order of age,and have age field values 16, 18, 20, 21, and 23. Further, the order ofrecords corresponding to the same age remains the same as the order ofrecords as appear in the original record number array OrdSet.

The parallel sorting method has been described with reference to anexample of ascending order sorting regarding age, but may as well beapplied to descending order sorting regarding age. Descending ordersorting is performed similarly to ascending order sorting, but isdifferent from ascending order sorting with regard to the order ofaccumulation process. FIGS. 16A and 16B are drawings for explaining theaccumulation step of the parallel sorting method according to theembodiment of the present invention. As shown in FIG. 16A, accumulationis first performed in the horizontal direction (i.e., with respect to arow having the same subscript), and, then, the cumulative number of thefollowing row is added to the cumulative numbers of the preceding row,resulting in the obtainment of all the cumulative numbers. It should benoted that accumulation in the horizontal direction can be performed byeach CPU in parallel.

Count[i][j] is used to represent a count value of field value sequencenumber j (0≦j≦q−1) counted up by CPU-i that is the i-th CPU (0≦i≦p−1),and a cumulative number is indicated by Count′[i][j]. Then, theaccumulation process can be described as follows.Count′[p−1][0]=0Count′[i][0]=Count′[i+1][q−1]+Count[i+1][q]i>1Count′[i][j]=Count′[i][j−1]+Count[i][j−1]j>1

In the cumulative number computation described above, there is a need tocarry over offset Count′[i+1][q−1] from a following row to a precedingrow. Although the CPUs perform the respective portions of the cumulativenumber computation in this embodiment, one of the processors may beselected to make this processor solely perform the cumulative numbercomputation. FIG. 16B shows the sequence of the accumulation process inthe vertical direction. In FIG. 16B, the row corresponding to “(1)Count-0: 4”, for example, indicates that count value “1” of firstelement Count-0[4] of the array Count-0 is converted into cumulativenumber 0.

FIGS. 17A and 17B through FIGS. 19A and 19B are drawings for explainingthe transfer step 505 of the descending order parallel sorting method.In the transfer step, each CPU reads a record number belonging to theallocated range from the record number array OrdSet, and uses thisrecord number as a subscript to read a field value sequence number fromthe pointer array VNo, followed by using this field value sequencenumber as a subscript to read a cumulative number from the Count arraythat is associated with the CPU and contains cumulative numbers. Theobtained cumulative number is used as a pointer to store the recordnumber in a new record number array OrdSet′, followed by incrementingthe cumulative number of the Count array by 1.

FIGS. 20A and 20B and FIGS. 21A through 21C are drawings showingoutcomes obtained by the descending order parallel sorting methodaccording to the embodiment of the present invention applied to the datastructure shown in FIG. 4B. In this example, descending-order sorting isperformed with respect to the age field, so that the obtained recordnumber array OrdSet′ includes records that are arranged in descendingorder of age, and have age field values 23, 21, 20, 18, and 16. Further,the order of records corresponding to the same age remains the same asthe order of records as appear in the original record number arrayOrdSet.

[Parallel Accumulation Computation]

In the following, the accumulation step 504 described in the aboveembodiment will further be described in detail. When the count resultsas shown in FIG. 9B are obtained, the accumulation process as shown inFIGS. 10A and 10B is performed. In order to perform accumulation inparallel, each CPU is assigned with a respective range of field valuesequence numbers. Field value sequence numbers 0 and 1 are assigned toCPU-0, field value sequence number 2 assigned to CPU-1, field valuesequence number 3 assigned to CPU-2, and field value sequence number 4assigned to CPU-3. When the elements of the Count arrays are representedas Count[i][j] as described above (i: CPU number performing counting, j:field value sequence numbers), the respective ranges of the CPUs foraccumulation are as follows.

Range of CPU-0 (field value sequence numbers 0 and 1)

Count[0][0]=1

Count[1][0]=2

Count[2][0]=2

Count[3][0]=0

Count[0][1]=2

Count[1][1]=0

Count[2][1]=2

Count[3][1]=2

Range of CPU-1 (field value sequence number 2)

Count[0][2]=0

Count[1][2]=2

Count[2][2]=0

Count[3][2]=1

Range of CPU-2 (field value sequence number 3)

Count[0][3]=1

Count[1][3]=1

Count[2][3]=0

Count[3][3]=1

Range of CPU-3 (field value sequence number 4)

Count[0][4]=1

Count[1][4]=0

Count[2][4]=1

Count[3][4]=1

With the respective ranges as determined above, each CPU-i calculatesSum[i] that is the sum of counts within the assigned range as follows.

Sum[0]=11

Sum[1]=3

Sum[2]=3

Sum[3]=3

The computation of these sums is parallel processing.

After this, these sums are made to propagate successively from CPU-0 toCPU-3 to obtain aggregated sums Aggr_sum[i] as follows.Aggr_sum[0]=0Aggr_sum[1]=Aggr_sum[0]+Sum[0]=11Aggr_sum[2]=Aggr_sum[1]+Sum[1]=14Aggr_sum[3]=Aggr_sum[2]+Sum[2]=17The aggregated sums are defined such that the first value is 0.

At the end, each CPU-i converts count values into cumulative numberswithin the respective range, and adds derived aggregated sum Aggr_sum[i]to the cumulative numbers so as to obtain final cumulative numbersCount′. The computation of Count′ is also parallel processing. With thisarrangement, the following is obtained.

Range of CPU-0 (field value sequence numbers 0 and 1)Count′[0][0]=0+Aggr_sum[0]=0+0=0Count′[1][0]=Count′[0][0]+Count[0][0]=0+1=1Count′[2][0]=Count′[1][0]+Count[1][0]=1+2=3Count′[3][0]=Count′[2][0]+Count[2][0]=3+2=5Count′[0][1]=Count′[3][0]+Count[3][0]=5+0=5Count′[1][1]=Count′[0][1]+Count[0][1]=5+2=7Count′[2][1]=Count′[1][1]+Count[1][1]=7+0=7Count′[3][1]=Count′[2][1]+Count[2][1]=7+2=9

Range of CPU-1 (field value sequence number 2)Count′[0][2]=0+Aggr_sum[1]=9+2=11Count′[1][2]=Count′[0][2]+Count[0][2]=11+0=11Count′[2][2]=Count′[1][2]+Count[1][2]=11+2=13Count′[3][2]=Count′[2][2]+Count[2][2]=13+0=13

Range of CPU-2 (field value sequence number 3)Count′[0][3]=0+Aggr_sum[2]=0+14=14Count′[1][3]=Count′[0][3]+Count[0][3]=14+1=15Count′[2][3]=Count′[1][3]+Count[1][3]=15+1=16Count′[3][3]=Count′[2][3]+Count[2][3]=16+0=16

Range of CPU-3 (field value sequence number 4)Count′[0][4]=0+Aggr_sum[3]=0+17=17Count′[1][4]=Count′[0][4]+Count[0][4]=17+1=18Count′[2][4]=Count′[1][4]+Count[1][4]=18+0=18Count′[3][4]=Count′[2][4]+Count[2][4]=18+1=19

These results match the results of cumulative number computation shownin FIG. 10B.

[Multi-Stage Parallel Sorting]

The above-described parallel sorting based on counting sort may becombined with the concept of radix sort. When the size of a field valuearray VL is large, i.e., when the number of field value sequence numbersis large, the field value sequence numbers can be expressed by radixrepresentations to perform the above-described parallel sort on adigit-by-digit basis, thereby achieving efficient sorting. In thefollowing, such multi-state parallel sorting method will be described.The multi-stage parallel sorting according to this embodiment performssorting with respect to successive digits of interest by stating fromthe least-significant digit, and performs sorting with respect to themost-significant digit at the end, which concludes the sorting process.

The data structure shown in FIG. 4B that is used in the example of thepreviously described parallel sorting method is also used in an exampleof the multi-stage parallel sorting method according to the embodimentof the present invention. In this embodiment, the number of CPUs isfour, all of which operate in parallel. It should be noted that thenumber of CPUs in the system and the number of CPUs operating inparallel are not limited to this example. In the following, for the sakeof convenience of explanation, a description will be given of a case inwhich the age field items are sorted in an ascending order of age. Theitems of the age field value array are arranged in an ascending order ofage. In the data structure shown in FIG. 4B, the age field valuesequence number VNo can assume values from 0 to 4. When the field valuesequence numbers are divided by use of radix-4, the field value sequencenumbers are broken into two digits comprised of an upper digit and alower digit. Specifically, the lower digit assumes the value that ismodulo (4) of the field value sequence number, and the upper digitassumes the value that is a quotient obtained by dividing the fieldvalue sequence number by 4.

FIG. 22 is a flowchart showing a multi-stage parallel sorting methodaccording to an embodiment of the present invention. The multi-stageparallel sorting method is comprised of 5 steps, i.e., step 2201 to step2205.

Step 2201: A radix (i.e., radix-4 in this example) for field valuesequence numbers is selected according to the range of the field valuesequence numbers, and an initial record number array OrdSet is used as acurrent record number array, with the least-significant digit of thefield value sequence numbers (the value of modulo (4) of a field valuesequence number) being selected as a digit of interest.

Step 2202: The current record number array is divided and allocated tothe four processors.

Step 2203: Each of the four processors counts the numbers of occurrencesof the values of the digit of interest in the field value sequencenumbers corresponding to the records contained in the allocated portionof the record number array.

Step 2204: The range of values of the digit of interest in the fieldvalue sequence numbers is divided and allocated to the four processors.

Step 2205: Each of the four CPUs converts the numbers of occurrences ofthe values of the digit of interest in the allocated field valuesequence numbers to cumulative numbers in the order of values of thedigit of interest in the field value sequence numbers and in the orderof the record number array portions within a range corresponding to thesame value of the digit of interest in the field value sequence numbers.

Step 2206: Each of the four CPUs utilizes as pointers the cumulativenumbers derived from the numbers of occurrences of the values of thedigit of interest in the field value sequence numbers corresponding tothe records contained in the allocated portion of the record numberarray, thereby storing the record numbers contained in the allocatedrecord-number-array portion in a new record number array.

Step 2207: A check is made as to whether sorting is performed up to themost-significant digit of the field value sequence numbers that areexpressed by radix representation, and the multi-stage parallel sortingprocess comes to an end if sorting has been performed up to themost-significant digit.

Step 2208: One of the remaining digits, if any, is selected as the digitof interest, and the procedure goes back to step 2202, with the newrecord number array then serving as a current record number array.

In the multi-stage parallel sorting method according to the embodimentof the present invention described above, the sorting process from step2202 to step 2206 is substantially the same as the parallel sortingmethod of the present invention, except that the value of the digit ofinterest in a field value sequence number is used in place of a fieldvalue sequence number.

In the following, the multi-stage parallel sorting method according tothe embodiment of the present invention will be described in detail. Inthis example, the data shown in FIG. 4B is sorted in ascending order ofage by use of four CPUs. The initialization step 2201 makes sortsettings with respect to the value of modulo-4 (MOD 4) of an age fieldvalue sequence number (i.e., the value of the lower digit) as afirst-stage sorting process, and makes sort settings with respect to thevalue of a quotient (DIV 4) obtained by dividing an age field valuesequence number by four as a second-stage sorting process.

In the initialization step 2201, arrays similar to the Count arraysshown in FIG. 6 are prepared. The arrays in this example are used tocount the numbers of occurrences of the values of the digit of currentinterest in field value sequence numbers.

FIGS. 23A and 23B through FIGS. 25A and 25B are drawings for explaininga count step in the first stage of the multi-stage parallel sortingmethod. In sub-step 1 of FIG. 23A, CPU-0 reads value “0” of OrdSet[0],and uses this read value “0” as a subscript to read value “1” of VNo[0],followed by using modulo-4 (MOD 4) value “1” of this value “1” as asubscript to increment value “0” of Count-0[1] to “1”, for example. Bythe same token, CPU-1 reads value “5” of OrdSet[5], and uses this readvalue “5” as a subscript to read value “2” of VNo[5], followed by usingMOD 4 value “2” of this value “2” as a subscript to increment value “0”of Count-1[2] to “1”. Thereafter, sub-step 2 of FIG. 23B, sub-step 3 ofFIG. 24A, sub-step 4 of FIG. 24B, and sub-step 5 of FIG. 25A areperformed, resulting in the count-up outcomes as shown in FIG. 25B.Element Count-0[i] of the array Count-0 shown in FIGS. 23A and 23Bthrough FIGS. 25A and 25B represents the number of occurrences of thevalue i of the digit of interest in an age field value sequence numbercorresponding to the records contained in the range from OrdSet[0] toOrdSet[4] of the array OrdSet allocated to CPU-0. For example,Count-0[0] indicates that the number of occurrences of value “0” of thelower-order digit in field value sequence numbers within the rangeallocated to CPU-0 is 1, and Count-3[1] indicates that the number ofoccurrences of value “1” of the lower-order digit in field valuesequence numbers within the range allocated to CPU-3 is 2.

FIGS. 26A and 26B are drawings for explaining an accumulation step inthe first stage of the multi-stage parallel sorting method. In thisexample, cumulative numbers are obtained in an ascending order of valuesof the lower-order digit in field value sequence numbers in conformitywith the ascending order sorting. CPU-0 takes care of the task ofobtaining cumulative numbers for the first row of the Count array (i.e.,value “0” of the lower-order digit of field value sequence numbers), andCPU-1 through CPU-3 take care of the task of obtaining cumulativenumbers for the second through fourth rows of the Count array (i.e.,values “1” through “3” of the lower-order digit of field value sequencenumbers), respectively. As shown in FIG. 26A, accumulation is firstperformed in the horizontal direction (i.e., with respect to a rowhaving the same subscript), and, then, the cumulative number of thepreceding row is added to the cumulative numbers of the following row,resulting in the obtainment of all the cumulative numbers. It should benoted that accumulation in the horizontal direction may be performed byeach CPU in parallel as previously described, but may as well beperformed by a single CPU.

FIGS. 27A and 27B through FIGS. 29A and 29B are drawings for explaininga transfer step that stores record numbers in a new record number arrayin the first stage of the multi-stage parallel sorting method. In thetransfer step, each CPU reads a record number belonging to the allocatedrange from the record number array OrdSet, and uses this record numberas a subscript to read the value of the lower-order digit of a fieldvalue sequence number from the pointer array VNo, followed by using thislower-order-digit value of the field value sequence number as asubscript to read a cumulative number from the Count array that isassociated with the CPU and contains cumulative numbers. The obtainedcumulative number is used as a pointer to store the record number in anew record number array OrdSet′, followed by incrementing the cumulativenumber of the Count array by 1. FIG. 29B shows a record number arrayOrdSet′ obtained by the above-described transfer step in the firststage.

In the second stage, the record number array OrdSet′ obtained in thefirst stage is used as the initial conditions, and ascending-ordersorting is performed with respect to the upper-order-digit values (DIV-4values) of the age field value sequence numbers.

FIG. 30 is a drawing showing the preparation of Count arrays that ismade by assigning the current record number array OrdSet′ to the fourCPUs in step 2202 of the second stage of the multi-stage parallelsorting method according to the embodiment of the present invention.

FIGS. 31A and 31B through FIGS. 33A and 33B are drawings for explaininga count step in the second stage of the multi-stage parallel sortingmethod. In sub-step 1 of FIG. 31A, CPU-0 reads value “2” of OrdSet′[0],and uses this read value “2” as a subscript to read value “4” of VNo[2],followed by using quotient (DIV4) value “1” obtained by dividing thisvalue “1” by 4 as a subscript to increment value “0” of Count-0[1] to“1”, for example. By the same token, CPU-1 reads value “12” ofOrdSet′[5], and uses this read value “12” as a subscript to read value“4” of VNo[12], followed by using DIV 4 value “1” of this value “4” as asubscript to increment value “0” of Count-1[1] to “1”. Thereafter,sub-step 2 of FIG. 31B, sub-step 3 of FIG. 32A, sub-step 4 of FIG. 32B,and sub-step 5 of FIG. 33A are performed, resulting in the second-stagecount-up outcomes as shown in FIG. 33B. Element Count-0[i] of the arrayCount-0 shown in FIGS. 31A and 31B through FIGS. 33A and 33B representsthe number of occurrences of the upper-order-digit value i in age fieldvalue sequence numbers corresponding to the records contained in therange from OrdSet′[0] to OrdSet′[4] of the array OrdSet′ allocated toCPU-0. For example, Count-0[0] indicates that the number of occurrencesof value “0” of the upper-order digit in field value sequence numberswithin the range allocated to CPU-0 is 4, and Count-3[1] indicates thatthe number of occurrences of value “1” of the upper-order digit in fieldvalue sequence numbers within the range allocated to CPU-3 is 0.

FIG. 34 is a drawing for explaining an accumulation step in the secondstage of the multi-stage parallel sorting method. In this example,cumulative numbers are obtained in an ascending order of values of theupper-order digit in field value sequence numbers in conformity with theascending order sorting. Since the number of the upper-order-digitvalues of field value sequence numbers is reduced to two throughconversion into the multi-stages, CPU-0, for example, takes care ofobtaining cumulative numbers for all the values in this example. Asshown in FIG. 34A, CPU0 obtains cumulative numbers in the followingorder: Count[0][0], Count[1][0], Count[2][0], Count[3][0], Count[0][1],Count[1][1], Count[2][1], and Count[3][1]. In this example, of course,the computation of cumulative numbers may be performed by two CPUs, withupper-order-digit values “0” and “1” of field value sequence numbersbeing assigned to CPU-0 and CPU-1, respectively.

FIGS. 35A and 35B through FIGS. 37A and 37B are drawings for explaininga transfer step that stores record numbers in a new record number arrayin the second stage of the multi-stage parallel sorting method. In thetransfer step, each CPU reads a record number belonging to the allocatedrange from the record number array OrdSet, and uses this record numberas a subscript to read the value of the upper-order digit of a fieldvalue sequence number from the pointer array VNo, followed by using thisupper-order-digit value of the field value sequence number as asubscript to read a cumulative number from the Count array that isassociated with the CPU and contains cumulative numbers. The obtainedcumulative number is used as a pointer to store the record number in anew record number array OrdSet″, followed by incrementing the cumulativenumber of the Count array by 1. FIG. 37B shows a record number arrayOrdSet″ obtained by the above-described transfer step in the secondstage.

The multi-stage parallel sorting method of this embodiment is comprisedof the two stages directed to the lower-order digit and upper-orderdigit of field value sequence numbers, so that no more sorting processwill be performed. Namely, the record number array OrdSet″ obtained inthe second step is the results of ascending-order sorting regarding ageperformed with respect to the initial record number array OrdSet.

FIGS. 38A through 38C and FIGS. 39A and 39B are drawings showingoutcomes obtained by the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention applied tothe data structure shown in FIG. 4B. In this example, ascending-ordersorting is performed with respect to the age field, so that the obtainedrecord number array OrdSet″ includes records that are arranged in anascending order of age, and have age field values 16, 18, 20, 21, and23. Further, the order of records corresponding to the same age remainsthe same as the order of records as appear in the original record numberarray OrdSet. These results match the results that are obtained byapplying the ascending-order parallel sorting method according to theembodiment of the present invention shown in FIGS. 14A through 14C andFIGS. 15A and 15B to the data structure shown in FIG. 4B.

Although the multi-stage parallel sorting method described above isdirected to ascending-order sorting, the multi-stage parallel sortingmethod of the present invention may as well be applicable todescending-order sorting. As previously described, the computation ofcumulative numbers in each stage of multi-stage parallel sorting may beperformed in parallel by a plurality of processors, or may be performedby at least one processor, preferably by a single processor alone.

[Multi-Stage Sorting]

The multi-stage parallel sorting described above performs sorting withrespect to successive digits of interest by stating from theleast-significant digit, and performs sorting with respect to themost-significant digit at the end, which concludes the sorting process.Alternatively, a sorting process may be performed with respect tosuccessive digits of interest by stating from the most-significantdigit, and may be performed with respect to the least-significant digitat the end, which concludes the sorting process. In the following, abrief description will be given of a method by which a sorting processis converted into multi-stages in sequence from the most-significantorder to the least-significant order.

In this example, a data structure as shown in FIG. 40 is utilized.Further, the number of CPUs is one in this example. In the following, adescription will be given of a case in which the age field items aresorted in ascending order of age. The total number of records is 20 fromrecord number “0” to record number “19”, and there are 9 field valuesequence numbers from “0” to “8”. Namely, the actual age is one of the 9values, i.e., 15, 16, 18, 19, 20, 21, 23, 25 and 28. In the datastructure shown in FIG. 40, the age field value sequence number VNo canassume values from 0 to 8. When the field value sequence numbers aredivided by use of radix-4, a quotient obtained by dividing a field valuesequence number by four is an upper-order-digit value, and a module-4value of the field value sequence number is a lower-order-digit value.The upper-order digit of a field value sequence number assumes one ofthe three values “0”, “1”, and “2”, and the lower-order digit assumesone of the four values “0”, “1”, “2”, and “3”.

First, an array Count-i for counting the numbers of occurrences of theupper-digit values “0”, “1”, and “2” is prepared and initialized bysetting “0” to all the elements thereof in the first stage. For example,Count-1[0] is an area used to count the number of records for which theupper-order value of the field value sequence number is 0.

Next, starting from the first element (i.e., record) in the recordnumber array OrdSet, the corresponding field value sequence numbers aresuccessively read from the array VNo, and a quotient obtained bydividing each field value sequence number by four is used as a pointerto increment the value of an element in the array Count-1. FIGS. 41Athrough 41D are drawings for exampling an example in which theupper-order-digit values of field value sequence numbers are computedfor the three record numbers OrdSet[0]=0, OrdSet[7]=7, OrdSet[19]=19,and the corresponding counters are counted up, followed by anaccumulation process. As shown in FIG. 41C, the count-up process of thefirst stage indicates that the number of records havingupper-order-digit value “0” of a field value sequence number is 12, thatthe number of records having upper-order-digit value “1” is 7, and thatthe number of records having upper-order-digit value “2” is 1. As shownin FIG. 41D, the counted values are accumulated.

After this, array Aggr-1 in which the numbers of occurrences ofupper-order-digit values of field value sequence numbers are accumulatedis used to covert the record number array OrdSet into a new recordnumber array OrdSet′. Specifically, VNo[j] is read if OrdSet[i]=j, andAggr-1[k] is read where k is the quotient (VNo[j] DIV 4) obtained bydividing VNo[j] by four, followed by setting record number j toOrdSet[Aggr-1[k]], and then incrementing Aggr-1[k]. FIGS. 42A and 42Bare illustrative drawing for explaining a record number transfer processin such multi-stage sorting. FIG. 42A shows the transfer of OrdSet[0],and FIG. 42B shows the transfer of OrdSet[19]. FIG. 43 shows the recordnumber array OrdSet′ containing the results of the first-stage recordnumber transfer and a range of distribution of upper-order-digit values.The records having upper-order-digit value “0” are distributed withinthe range (section 0) from OrdSet′[0] to OrdSet′[11] of the recordnumber array OrdSet′, and the records having upper-order-digit value “1”are distributed within the range (section 1) from OrdSet′[12] toOrdSet′[18] of the record number array OrdSet′, with the records havingupper-order-digit value “2” being present in OrdSet′[19] (section 2) ofthe record number array OrdSet′.

In the second stage of the multi-stage parallel sorting, the recordnumbers are sorted in each section according to the lower-order-digitvalues of field value sequence numbers. Section 1 of OrdSet′, forexample, is transferred to corresponding section 1 of OrdSet″. Insecond-stage sorting, no record number is transferred to outside thesection because the section is already determined based on theupper-order digit.

FIG. 44 is a drawing showing the initial conditions of the second stageof the multi-stage sorting. In the following, section 1 of OrdSet′ willbe described. When a plurality of processors are present, theseprocessors may be assigned to respective sections, so that the followingprocess is performed in parallel. Count-2 is used to count the numbersof occurrences of the lower-order-digit values (0, 1, 2, 3) of the fieldvalue sequence numbers in section 1.

FIGS. 45A through 45C are drawings for explaining counting-up andaccumulation performed in the second stage of the multi-stage sorting. Acount-up array as shown in FIG. 45B is obtained by successivelyperforming a count-up by starting from FIG. 45A. This count-up array issubjected to accumulation as shown in FIG. 45C.

Finally, the second cumulative-number array Aggr-2 is used as pointersto transfer section 1 of the record number array OrdSet′ to section 1 ofthe record number array OrdSet″, with which the multi-stage sortingcomes to an end. FIGS. 46A and 46B are drawings for explainingrecord-number transfer in the second stage of the multi-stage sorting.Specifically, VNo[j] is read if OrdSet′[i]=j, and Aggr-2[k] is readwhere k is the remainder (VNo[j] MOD 4) obtained by dividing VNo[j] byfour, followed by setting record number j to OrdSet″[Aggr-2[k]], andthen incrementing Aggr-2[k]. FIG. 46A shows the transfer of OrdSet′[14],and FIG. 46B shows the transfer of OrdSet′[18]. Section 1 of OrdSet″shown in FIG. 46B demonstrates the final results of sorting obtained forsection 1.

Similarly to section 1, the second-stage counting-up, accumulation, andrecord-number transfer are performed with respect to section 0 andsection 2, so that the entirety of the record number array OrdSet istransferred to the record number array OrdSet″. With this, the sortingcomes to an end.

As previously described, according to the embodiment of the presentinvention, the computer system 10 executes a program for sorting recordsaccording to the field values of the records in a predetermined field.More specifically, the program according to the present embodimentcauses each CPU to perform the above-described process steps or toperform the above-described functions as will be described in thefollowing.

In the present embodiment, the computer system 10 may be provided withan OS (e.g., Linux: registered trademark). At the initial state, a givenCPU (e.g., CPU 12-1) loads the program to memory (e.g., the sharedmemory 14) under the control of the OS. As the program is loaded to thememory, the CPUs 12-1, 12-2, . . . , and 12-p operate under the controlof the OS to perform a predetermined function when each of the CPUs issupposed to perform processing. Namely, each CPU reads predeterminedprocess steps of the program stored in the shared memory 14, and executethese process steps. On the other hand, when a specific CPU is supposedto perform processing, this specific CPU operates under the control ofthe OS to perform another predetermined function. Namely, only thespecific CPU reads other predetermined process steps of the programstored in the shared memory 14, and execute these other process steps.The storage location for the program executed by each CPU is not limitedto the shared memory 14, but may be a local memory (not shown)associated with each CPU.

In this manner, the program causes each CPU to perform a predeterminedfunction under the control of the OS according to the presentembodiment, and may cause a specific CPU to perform anotherpredetermined function according to need.

The present invention is not limited to the embodiments described above,and may be subject to various modifications within the scope of theinvention described in the claims, with explicit understanding that suchmodifications are within the scope of the present invention.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a schematic diagram of a computer system according to anembodiment of the present invention.

FIG. 2 is a drawing showing an example of table data for explaining adata management mechanism.

FIG. 3 is an illustrative drawing for explaining the data managementmechanism according to the embodiment of the present invention.

FIGS. 4A and 4B are drawings for explaining a data structure subjectedto sorting according to the embodiment of the present invention.

FIG. 5 is a flowchart showing a parallel sorting method according to anembodiment of the present invention.

FIG. 6 is an illustrative drawing for explaining an initialization stepof the parallel sorting method according to the embodiment of thepresent invention.

FIGS. 7A and 7B are drawings (1) for explaining the count-up step of theparallel sorting method according to the embodiment of the presentinvention.

FIGS. 8A and 8B are drawings (2) for explaining the count-up step of theparallel sorting method according to the embodiment of the presentinvention.

FIGS. 9A and 9B are drawings (3) for explaining the count-up step of theparallel sorting method according to the embodiment of the presentinvention.

FIGS. 10A and 10B are drawings for explaining the accumulation step ofthe ascending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 11A and 11B are drawings (1) for explaining the transfer step ofthe ascending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 12A and 12B are drawings (2) for explaining the transfer step ofthe ascending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 13A and 13B are drawings (3) for explaining the transfer step ofthe ascending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 14A through 14C are drawings (1) showing outcomes obtained by theascending order parallel sorting method according to the embodiment ofthe present invention applied to the data structure shown in FIG. 4B.

FIGS. 15A and 15B are drawings (2) showing outcomes obtained by theascending order parallel sorting method according to the embodiment ofthe present invention applied to the data structure shown in FIG. 4B.

FIGS. 16A and 16B are drawings for explaining the accumulation step ofthe descending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 17A and 17B are drawings (1) for explaining the transfer step ofthe descending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 18A and 18B are drawings (2) for explaining the transfer step ofthe descending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 19A and 19B are drawings (3) for explaining the transfer step ofthe descending-order parallel sorting method according to the embodimentof the present invention.

FIGS. 20A and 20B are drawings (1) showing outcomes obtained by thedescending order parallel sorting method according to the embodiment ofthe present invention applied to the data structure shown in FIG. 4B.

FIGS. 21A through 21C are drawings (2) showing outcomes obtained by thedescending order parallel sorting method according to the embodiment ofthe present invention applied to the data structure shown in FIG. 4B.

FIG. 22 is a flowchart showing a multi-stage parallel sorting methodaccording to an embodiment of the present invention.

FIGS. 23A and 23B are drawings (1) for explaining a count-up step in thefirst stage of the multi-stage parallel sorting method according to theembodiment of the present invention.

FIGS. 24A and 24B are drawings (2) for explaining the count-up step inthe first stage of the multi-stage parallel sorting method according tothe embodiment of the present invention.

FIGS. 25A and 25B are drawings (3) for explaining the count-up step inthe first stage of the multi-stage parallel sorting method according tothe embodiment of the present invention.

FIGS. 26A and 26B are drawings for explaining an accumulation step inthe first stage of the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention.

FIGS. 27A and 27B are drawings (1) for explaining a transfer step in thefirst stage of the multi-stage ascending-order parallel sorting methodaccording to the embodiment of the present invention.

FIGS. 28A and 28B are drawings (2) for explaining the transfer step inthe first stage of the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention.

FIGS. 29A and 29B are drawings (3) for explaining the transfer step inthe first stage of the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention.

FIG. 30 is an illustrative drawing for explaining an initialization stepof the second stage of the multi-stage parallel sorting method accordingto the embodiment of the present invention.

FIGS. 31A and 31B are drawings (1) for explaining a count-up step in thesecond stage of the multi-stage parallel sorting method according to theembodiment of the present invention.

FIGS. 32A and 32B are drawings (2) for explaining the count-up step inthe second stage of the multi-stage parallel sorting method according tothe embodiment of the present invention.

FIGS. 33A and 33B are drawings (3) for explaining the count-up step inthe second stage of the multi-stage parallel sorting method according tothe embodiment of the present invention.

FIG. 34 is an illustrative drawing for explaining an accumulation stepof the second stage of the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention.

FIGS. 35A and 35B are drawings (1) for explaining a transfer step in thesecond stage of the multi-stage ascending-order parallel sorting methodaccording to the embodiment of the present invention.

FIGS. 36A and 36B are drawings (2) for explaining the transfer step inthe second stage of the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention.

FIGS. 37A and 37B are drawings (3) for explaining the transfer step inthe second stage of the multi-stage ascending-order parallel sortingmethod according to the embodiment of the present invention.

FIGS. 38A through 38C are drawings (1) showing outcomes obtained by themulti-stage ascending order parallel sorting method according to theembodiment of the present invention applied to the data structure shownin FIG. 4B.

FIGS. 39A and 39B are drawings (2) showing outcomes obtained by themulti-stage ascending order parallel sorting method according to theembodiment of the present invention applied to the data structure shownin FIG. 4B.

FIG. 40 is a data structure diagram for explaining multi-stage sorting.

FIGS. 41A through 41D are drawings for explaining counting-up andaccumulation performed in the first stage of the multi-stage sorting.

FIGS. 42A and 42B are drawings for explaining record-number transfer inthe first stage of the multi-stage sorting.

FIG. 43 is a drawing for explaining the results of record-numbertransfer in the first stage of the multi-stage sorting.

FIG. 44 is a drawing showing the initial conditions of the second stageof the multi-stage sorting.

FIGS. 45A through 45C are drawings for explaining counting-up andaccumulation performed in the second stage of the multi-stage sorting.

FIGS. 46A and 46B are drawings for explaining record-number transfer inthe second stage of the multi-stage sorting.

DESCRIPTION OF REFERENCE NUMBERS

-   10 Computer System-   12-1, 12-2, . . . , 12-p CPU-   14 Shared Memory-   16 ROM-   18 Fixed Storage Device-   20 CD-ROM Driver-   22 I/F-   24 Input Apparatus-   26 Display Apparatus

1. An information processing method of rearranging an order of recordsaccording to field values of the records in a predetermined field in ashared-memory multiprocessor system including a shared memory to store arecord number array in which record numbers of table data records arestored according to a predetermined record order, a field value sequencenumber array in which field value sequence numbers corresponding tofield values of the table data records in the predetermined field arestored in such a manner as to be associated with the record numbers, anda field value array in which the field values of the table data arestored according to an order of the field value sequence numberscorresponding to the field values, and further including n (n≧1)processors operable to access the shared memory, said informationprocessing method comprising: a step of dividing the record number arrayinto n1 (n1≦n) portions to allocate the n1 divided portions of therecord number array to n1 processors among the n processors,respectively; a step of counting, by each of the n1 processors, numbersof occurrences of the field value sequence numbers associated with therecord numbers contained in an allocated portion of the record numberarray; a step of dividing a range of the field value sequence numbersinto n2 (n2≦n) ranges to allocate the n2 divided ranges of the fieldvalue sequence numbers to n2 processors among the n processors,respectively; a step of converting, by each of the n2 processors, therespective numbers of occurrences of the field value sequence numberscounted by the n1 processors into cumulative numbers in an order of thefield value sequence numbers where the field value sequence numbers aredifferent from each other and in an order of the portions of the recordnumber array where two or more processors have counted the numbers ofoccurrences of a common field value sequence number; and a step ofutilizing, by each of the n1 processors, as pointers the cumulativenumbers of the field value sequence numbers associated with the recordnumbers contained in the allocated portion of the record number array,thereby storing the record numbers contained in the allocated portion ofthe record number array in a new record number array.
 2. An informationprocessing method of rearranging an order of records according to fieldvalues of the records in a predetermined field in a shared-memorymultiprocessor system including a shared memory to store a record numberarray in which record numbers of table data records are stored accordingto a predetermined record order, a field value sequence number array inwhich field value sequence numbers corresponding to field values of thetable data records in the predetermined field are stored in such amanner as to be associated with the record numbers, and a field valuearray in which the field values of the table data are stored accordingto an order of the field value sequence numbers corresponding to thefield values, and further including n (n≧1) processors operable toaccess the shared memory, said information processing method comprising:a step of dividing the record number array into n1 (n1≦n) portions toallocate the n1 divided portions of the record number array to n1processors among the n processors; a step of counting, by each of the n1processors, numbers of occurrences of the field value sequence numbersassociated with the record numbers contained in an allocated portion ofthe record number array; a step of dividing a range of the field valuesequence numbers into n2 (n2≦n) ranges to allocate the n2 divided rangesof the field value sequence numbers to n2 processors among the nprocessors; a step of using each of the n2 processors to perform, withrespect to the field value sequence numbers allocated to the n2processors, (i) deriving a sum of the numbers of occurrences counted byeach of the n1 processors and carrying over the sum between the n2processors in an order of the ranges of the field value sequencenumbers, and (ii) converting the numbers of occurrences into cumulativenumbers in an order of the field value sequence numbers where the fieldvalue sequence numbers are different from each other and in an order ofthe portions of the record number array where two or more processorshave counted the numbers of occurrences of a common field value sequencenumber, followed by adding the carried over sum to the cumulativenumbers so as to covert the number of occurrences into cumulativenumbers separately for each of the field value sequence numbersassociated with the record numbers contained in the respective portionof the record number array allocated to each of the n1 processors; and astep of utilizing, by each of the n1 processors, as pointers thecumulative numbers obtained separately for each of the field valuesequence numbers associated with the record numbers contained in theallocated portion of the record number array, thereby storing the recordnumbers contained in the allocated portion of the record number array ina new record number array.
 3. An information processing method ofrearranging an order of records according to field values of the recordsin a predetermined field in a shared-memory multiprocessor systemincluding a shared memory to store a record number array in which recordnumbers of table data records are stored according to a predeterminedrecord order, a field value sequence number array in which field valuesequence numbers corresponding to field values of the table data recordsin the predetermined field are stored in such a manner as to beassociated with the record numbers, and a field value array in which thefield values of the table data are stored according to an order of thefield value sequence numbers corresponding to the field values, andfurther including n (n≧1) processors operable to access the sharedmemory, said information processing method comprising: a step ofselecting, by at least one processor, radix representation of the fieldvalue sequence numbers in response to a range of the field valuesequence numbers so as to divide the field value sequence numbers intoan upper-order digit and a lower-order digit; a step of counting, by atleast one processor, numbers of occurrences of values of the upper-orderdigit of the field value sequence numbers associated with the recordnumbers contained in the record number array, converting the numbers ofoccurrences into cumulative numbers according to an order of the valuesof the upper-order digit of the field value sequence numbers, andrearranging the record numbers in the record number array by utilizingas pointers the cumulative numbers of the values of the upper-orderdigit of the field value sequence numbers, thereby generating anintermediate record number array which are partitioned into n1 (sn)sections according to the order of the values of the upper-order digitof the field value sequence numbers; a step of allocating, by at leastone processor, the n1 sections of the intermediate record number arrayto n1 processors among the n processors, respectively: and a step ofcounting, by each of the processors allocated to the respectivesections, numbers of occurrences of values of the lower-order digit ofthe field value sequence numbers associated with the record numberscontained in an allocated section of the intermediate record numberarray, converting the numbers of occurrences into cumulative numbersaccording to an order of the values of the lower-order digit of thefield value sequence numbers, and rearranging the record numbers in theallocated section of the intermediate record number array in the orderof the values of the lower order digit of the associated field valuesequence numbers by utilizing as pointers the cumulative numbers of thevalues of the lower-order digit of the field value sequence numbers. 4.A shared-memory multiprocessor system comprising a shared memory and n(n≧1) processors operable to access the shared memory, wherein theshared memory stores a record number array in which record numbers oftable data records are stored according to a predetermined record order,a field value sequence number array in which field value sequencenumbers corresponding to field values of the table data records in thepredetermined field are stored in such a manner as to be associated withthe record numbers, and a field value array in which the field values ofthe table data are stored according to an order of the field valuesequence numbers corresponding to the field values, and each of theprocessors includes: a part to determine which one of n1 (n1≦n) portionsof the record number array is to be of processed by a correspondingprocessor; a part to count numbers of occurrences of the field valuesequence numbers associated with the record numbers contained in theportion of the record number array; a part to determine which one of n2(n2≦n) ranges of the field value sequence numbers is to be processed bya corresponding processor; a part to convert respective numbers ofoccurrences of the field value sequence numbers contained in the rangeof processed by the corresponding processor into cumulative numbers inan order of the field value sequence numbers where the field valuesequence numbers are different from each other and in an order of theportions of the record number array where two or more processors havecounted the numbers of occurrences of a common field value sequencenumber; and a part to utilize as pointers the cumulative numbers of thefield value sequence numbers associated with the record numberscontained in the portion of the record number array, thereby storing therecord numbers contained in the portion of the record number array in anew record number array.
 5. The shared-memory multiprocessor system asclaimed in claim 4, wherein the cumulative numbers obtained by the partto convert the numbers of occurrences into the cumulative numbers in aprocessor processing an immediately preceding range of the field valuesequence numbers are referred to by the part to convert the numbers ofoccurrences into the cumulative numbers in a processor processing animmediately following range.
 6. A program for use in a shared-memorymultiprocessor system including a shared memory to store a record numberarray in which record numbers of table data records are stored accordingto a predetermined record order, a field value sequence number array inwhich field value sequence numbers corresponding to field values of thetable data records in the predetermined field are stored in such amanner as to be associated with the record numbers, and a field valuearray in which the field values of the table data are stored accordingto an order of the field value sequence numbers corresponding to thefield values, and further including n (n≧1) processors operable toaccess the shared memory, said program causing each of the processors toperform: a function to determine which one of n1 (n1≦n) portions of therecord number array is to be processed by a corresponding processor; afunction to count numbers of occurrences of the field value sequencenumbers associated with the record numbers contained in the portion ofthe record number array; a function to determine which one of n2 (n2≦n)ranges of the field value sequence numbers is to be processed by acorresponding processor; a function to convert respective numbers ofoccurrences of the field value sequence numbers contained in the rangeprocessed by the corresponding processor into cumulative numbers in anorder of the field value sequence numbers where the field value sequencenumbers are different from each other and in an order of the portions ofthe record number array where two or more processors have counted thenumbers of occurrences of a common field value sequence number; and afunction to utilize as pointers the cumulative numbers of the fieldvalue sequence numbers associated with the record numbers contained inthe portion of the record number array, thereby storing the recordnumbers contained in the portion of the record number array in a newrecord number array.
 7. A computer-readable record medium having aprogram of claim 6 recorded therein.
 8. A program for use in ashared-memory multiprocessor system including a shared memory to store arecord number array in which record numbers of table data records arestored according to a predetermined record order, a field value sequencenumber array in which field value sequence numbers corresponding tofield values of the table data records in the predetermined field arestored in such a manner as to be associated with the record numbers, anda field value array in which the field values of the table data arestored according to an order of the field value sequence numberscorresponding to the field values, and further including a plurality ofprocessors operable to access the shared memory, said program causingeach of the processors to perform: a function to select radixrepresentation of the field value sequence numbers in response to arange of the field value sequence numbers; a function to control sortingof a digit of interest by selecting the digit of interest successivelyfrom a least significant digit to a most significant digit in the radixrepresentation of the field value sequence numbers, by use of the recordnumber array as a current record number array for a first time sortingand by use of a new record number array as a current record number arrayfor a second time sorting and onward; a function to determine a portionof the record number array that is to be processed by a correspondingprocessor; a function to count numbers of occurrences of values of thedigit of interest in the field value sequence numbers associated withthe record numbers contained in the portion of the record number array;a function to determine a range of the values of the digit of interestin the field value sequence numbers that is to be processed by acorresponding processor; a function to convert the respective numbers ofoccurrences of the values of the digit of interest of the allocatedfield value sequence numbers within the range processed by thecorresponding processor into cumulative numbers in an order of thevalues of the digit of interest of the field value sequence numberswhere the values of the digit of interest of the field value sequencenumbers are different from each other and in an order of the portions ofthe record number array where two or more processors have counted thenumbers of occurrences of a common value of the digit of interest of thefield value sequence numbers; and a function to utilize as pointers thecumulative numbers of the values of the digit of interest in the fieldvalue sequence numbers associated with the record numbers contained inthe portion of the record number array, thereby storing the recordnumbers contained in the portion of the record number array in a newrecord number array.
 9. A program for use in a shared-memorymultiprocessor system including a shared memory to store a record numberarray in which record numbers of table data records are stored accordingto a predetermined record order, a field value sequence number array inwhich field value sequence numbers corresponding to field values of thetable data records in the predetermined field are stored in such amanner as to be associated with the record numbers, and a field valuearray in which the field values of the table data are stored accordingto an order of the field value sequence numbers corresponding to thefield values, and further including a plurality of processors operableto access the shared memory, said program causing each of the processorsto perform: a function to select radix representation of the field valuesequence numbers in response to a range of the field value sequencenumbers; a function to control sorting of a digit of interest byselecting the digit of interest successively from a least significantdigit to a most significant digit in the radix representation of thefield value sequence numbers, by use of the record number array as acurrent record number array for a first time sorting and by use of a newrecord number array as a current record number array for a second timesorting and onward; a function to determine a portion of the recordnumber array that is to be processed by a corresponding processor; and afunction to count numbers of occurrences of values of the digit ofinterest in the field value sequence numbers associated with the recordnumbers contained in the portion of the record number array, wherein atleast one processor is caused by the program to perform the function toconvert the respective numbers of occurrences of the values of the digitof interest of the field value sequence numbers into cumulative numbersin an order of the values of the digit of interest of the field valuesequence numbers where the values of the digit of interest of the fieldvalue sequence numbers are different from each other and in an order ofthe portions of the record number array where two or more processorshave counted the numbers of occurrences of a common value of the digitof interest of the field value sequence numbers, and wherein said eachof the processors are caused by the program to further perform afunction to utilize as pointers the cumulative numbers of the values ofthe digit of interest in the field value sequence numbers associatedwith the record numbers contained in the portion of the record numberarray, thereby storing the record numbers contained in the portion ofthe record number array in a new record number array.
 10. A program foruse in a shared-memory multiprocessor system including a shared memoryto store a record number array in which record numbers of table datarecords are stored according to a predetermined record order, a fieldvalue sequence number array in which field value sequence numberscorresponding to field values of the table data records in thepredetermined field are stored in such a manner as to be associated withthe record numbers, and a field value array in which the field values ofthe table data are stored according to an order of the field valuesequence numbers corresponding to the field values, and furtherincluding n (n≧1) processors operable to access the shared memory, saidprogram causing: each of n1 processors among the n processors, to whichn1 (n1 sn) divided portions of the record number array are allocated, toperform a function to count numbers of occurrences of the field valuesequence numbers associated with the record numbers contained in anallocated portion of the record number array; each of n2 processorsamong the n processors, to which n2 (n2≦n) divided ranges of a range ofthe field value sequence numbers are allocated, to perform a functionof, with respect to the field value sequence numbers allocated to the n2processors, (i) deriving a sum of the numbers of occurrences counted byeach of the n1 processors and carrying over the sum between the n2processors in an order of the ranges of the field value sequencenumbers, and (ii) converting the numbers of occurrences into cumulativenumbers in an order of the field value sequence numbers where the fieldvalue sequence numbers are different from each other and in an order ofthe portions of the record number array where two or more processorshave counted the numbers of occurrences of a common field value sequencenumber, followed by adding the carried over sum to the cumulativenumbers so as to covert the number of occurrences into cumulativenumbers separately for each of the field value sequence numbersassociated with the record numbers contained in the respective portionof the record number array allocated to each of the n1 processors; andeach of the n1 processors to perform a function to utilize as pointersthe cumulative numbers obtained separately for each of the field valuesequence numbers associated with the record numbers contained in theallocated portion of the record number array, thereby storing the recordnumbers contained in the allocated portion of the record number array ina new record number array.
 11. A program for use in a shared-memorymultiprocessor system including a shared memory to store a record numberarray in which record numbers of table data records are stored accordingto a predetermined record order, a field value sequence number array inwhich field value sequence numbers corresponding to field values of thetable data records in the predetermined field are stored in such amanner as to be associated with the record numbers, and a field valuearray in which the field values of the table data are stored accordingto an order of the field value sequence numbers corresponding to thefield values, and further including n (n≧1) processors operable toaccess the shared memory, said program causing at least one processor toperform: a function to select radix representation of the field valuesequence numbers in response to a range of the field value sequencenumbers so as to divide the field value sequence numbers into anupper-order digit and a lower-order digit; and a function to countnumbers of occurrences of values of the upper-order digit of the fieldvalue sequence numbers associated with the record numbers contained inthe record number array, to convert the numbers of occurrences intocumulative numbers according to an order of the values of theupper-order digit of the field value sequence numbers, and to rearrangethe record numbers in the record number array by utilizing as pointersthe cumulative numbers of the values of the upper-order digit of thefield value sequence numbers, thereby generating an intermediate recordnumber array which are partitioned into n1 (≦n) sections according tothe order of the values of the upper-order digit of the field valuesequence numbers, wherein each of a plurality of processors allocated tothe respective sections of the intermediate record number array iscaused by the program to perform a function to count numbers ofoccurrences of values of the lower-order digit of the field valuesequence numbers associated with the record numbers contained in anallocated section of the intermediate record number array, to convertthe numbers of occurrences into cumulative numbers according to an orderof the values of the lower-order digit of the field value sequencenumbers, and to rearrange the record numbers in the allocated section ofthe intermediate record number array in the order of the values of thelower-order digit of the associated field value sequence numbers byutilizing as pointers the cumulative numbers of the values of thelower-order digit of the field value sequence numbers.